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c/3 ' Abstract 
O 

We show examples of colored graphs G and H, both on n vertices, that 
in the existential two-variable logic can be distinguished only with quantifier 
^> . depth more than Yff^^- This bound is tight up to a constant factor. It contrasts 

\ with the fact that, if such graphs are distinguished in the unrestricted two- 

' variable logic, they can be distinguished with quantifier depth at most n. We 

CN ■ also discuss extensions of these results to logics with more variables and/or 

CSl ! with more quantifier alternations. In particular, we show that the quantifier 

alternation hierarchy for /c-variable logic is strict over colored trees. Without 
colors, this result holds true if A; > 3, while the alternation hierarchy for 
two- variables logic collapses even if considered over all uncolored graphs. 
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c3 : 1 Introduction 

Let FO denote the class of all first-order formulas and FO'^ be the subclass of 
formulas with occurrences of at most k variables. We will refer to FO'^ as k-variable 
logic. The metatheory of these logics is rich in interesting results, even in the case 
of /c = 2; e.g., [HI [15l [121 [131 [lOl [M]. Finite- variable logics have found important 
applications in algorithmic model theory and descriptive complexity theory; see 
books [21 [in [26] and survey articles [H [HI [131 [E]- This is true even for "small" 
fragments of FO'^. The existential-positive /c-variable logic is useful in heuristic 
analysis of the constraint satisfaction problem [25] and in estimating the complexity 
of resolution proofs [H [3]. Propositional modal logic can be viewed as a fragment 
of FO^; see [13]. The syntactic extension of FO^ with counting quantifiers captures 
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the known color- refinement procedure [22], an efficient way of grapli canonization 
tliat works for almost all graphs. 

We will consider a finite- variable logic over finite vertex-colored graphs (without 
directed edges), though our results for this class of structures carry over directly to 
directed vertex- and edge-colored graphs (i.e., to the structures over any vocabulary 
consisting of binary relations). Our aim is to investigate how expressibility of FO'^ 
depends on the number of quantifier alternations allowed in a formula. In order to 
make a comparison, we use a quantitative measure of the expressive power of a logic 
£ C FO'^, namely the quantifier depth needed to distinguish two given graphs in £. 
This parameter is intimately related to the computational complexity of the logical 
equivalence problem for £ [I8l|2] and, hence, to testing isomorphism of graphs [6|IT8]. 
Being a very natural descriptive complexity measure, this parameter appears also 
in other areas; for example, it corresponds to the depth of resolution refutations of 
width fc - 1 [HE]. 

More specifically, we say that a formula $ distinguishes a graph G from a 
graph H if it is true on G but false on H. By Dc{G,H) we denote the mini- 
mum quantifier depth of such $ over formulas in a logic C; we write Dc{G, H) = oo 
if G is indistinguishable from H in C. We will simplify the notation by writing 
D''{G,H) in place of DpQk{G,H) and D^^{G,H) in place of D^^-^p^k^G, H). Note 
that D''{G,H) = D^{H,G). 

Dawar, Lindell, and Weinstein [5] proved that D^{G,H) < v{GY + A; + 1 if G 
and H are distinguishable in FO'^. Here and thoughout v{G) denotes the number of 
vertices in the graph G. Immerman et al. [22] |6] discovered a connection between 
the parameter D^{G,H) and the number of color refinements needed for the k- 
dimentional Weisfeiler-Lehman procedure to distinguish G and H. By a careful 
analysis in [30], Oleg Pikhurko and the second author showed that this connection 
leads to a better bound, namely 

D^{G,H) <v{Gf-^ (1) 

if v{G) > 2; see Section |3] where this bound is proved in detail for /c = 2 (the 
treatment in ^0] does not cover this case and states a little bit worse bound than ([1])). 
If = 2, this bound is tight up to a constant factor (for example, D'^{Pn, Pn+i) > 
n/2)] whether this is so for A; > 3 is a challenging question. 

Bound ([T]) stays true for logic with counting quantifiers, for which we will use 
notation D^{G,H). A linear lower bound D^{G,H) = fl{v{G)) is shown by Fiirer 
[TT] for each k (for an infinite sequence of pairs of graphs G and H with the same 
number of vertices). 

The existential fragment Si fl FO*^ of fc- variable logic consists of formulas that 
contain only existential quantifiers and are in the negation normal form (i.e., any 
negation stands in front of a relation symbol). For the existential logic we will often 
write D^{G, H) in place of D^^ (G, H). It is well known that D^(G, if) < r iff Spoiler 
has a strategy allowing him to win the existential fc-pebble Ehrenfeucht game on G 
and H in r rounds. If G is distinguishable from H in the existential fc-variable logic, 
this equivalence immediately gives us the bound D^{G,H) < {v{G)v{H))'^, just 
because the number of possible positions in the game is bounded by {v{G)v{H)Y . 
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Figure 1: Spoiler has to make many passes through G because H is large. 

Looking at the connection between logical distiguishability and the Ehrenfeucht 
game a bit more carefully, in Section H] we prove a better bound 

Dl{G,H)<{v{G)v{H)f~\ (2) 

This result is obtained by a rather general game-theoretic argument, that works as 
well for other logics having accompanying games, in particular, for the existential- 
positive fc-variable logic. 

The upper bound ([2]) for the existential case is worse than the best upper bound 
([T]) for FO'^. One goal of our work is to show a gap between and . As a 
first step it can be shown that D^{G,H) is lower bounded by fl{v{H)''~^). Using 
the construction in [3] one can derive colored graphs G and H where Spoiler needs 
Q{v{H)^~^) to win the existential fc-pebble game. This observation is stated in the 
following theorem, whose proof is omitted here. 

Theorem 1.1 For every fixed k > 2 there is a colored graph G, a constant c = c{k) 
and an infinite family l-i of colored graphs such that for all H El-L 

c ■ v{Hf-^ < D'^{G,H) <oo. (3) 

For k = 2 a, simple example is shown in Fig. [T] The case of A; > 3 is, however, 
much more complicated. If compared to the upper bound ([T]), Theorem 11.11 shows a 
huge gap between the expressive power of fc- variable logic and its existential fragment 
in terms of the parameter D{G,H) if the graph G is small. However, Theorem 11.11 
does not rule out the possibility that for graphs G and H with v{G) = v{H) = n the 
difference disappears. We think that also in that case the upper bound is tight, 
but given the current status of our research we have to state this as conjecture. 

Conjecture 1.2 For every fixed k > 2 there are infinite many pairs of colored 
graphs G and H with v{G) = v{H) = n such that 

c ■ n^''-^ < D^{G,H) <oo. (4) 

for a constant c depending on k only. 

In this paper we settle the conjecture for the case k = 2. In Section [6] we 
construct infinitely many pairs of colored graphs G and H with v{G) = v{H) = n 
such that G is distinguishable from H in the existential two-variable logicjl] with 

DliG,H)>^n', 
^and even in the existential-positive logic. 
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which matches the upper bound ([2]) up to a constant factor in the case of = 2. 
One could wonder if it is necessary to consider colored graphs or if the lower bound 
also holds true for uncolored graphs. It turns out that the quantifier-rank of the 
existential two- variable logic drops down to linear over uncolored graphs. In Section 
Owe prove that 

Dl{G,H) < 2v{H) 

if G and H are uncolored. Even this particular case is non-obvious and requires 
a delicate analysis. On the other hand, our analysis shows that two-variable logic 
over uncolored graphs is much less complicated than over colored graphs: The dis- 
tinguishability problem for this logic turns out to be decidable in logarithmic space 
in the former case (Corollary 14.31) and is known to be P-complete in the latter case 
(Kolaitis and Panttaja [23]). 

Section [7] is devoted to higher levels of the quantifier alternation hierarchy. In 
the particular case of uncolored graphs we observe that, if such graphs G and H are 
distinguished in FO^, then 

Dl^iG,H)<viG). (5) 

This shows that the alternation hierarchy over uncolored graphs collapses. The 
general case of colored graphs is also here much more complicated. Assuming that 
a colored graph G is distinguishable from a colored graph if in E2 fl FO^, we know 
neither if stays true nor if there are G and H both with n vertices such that 
D| {G, H) = i7(r2^). At least, we are able to show that the alternation hierarchy of 
FO^ over colored graphs is strict: For every i > 1 there are uncolored trees Gi and 
Hi such that Gi is indistinguishable from Hi in n, fl FO^ while 

Dl{G,,Hi)<t + l. 

This results extends to the case of /c > 3 variables, even over uncolored graphs. 

Note that the quantifier alternation hierarchy has received a considerable atten- 
tion in the past. Weis and Immerman [52] consider FO^ over words in an alphabet 
A and show that the hierarchy collapses if A is finite and is strict if A is infinite. 
Bojahczyk and Segoufin [^ characterize tree languages definable in E2nn2 in terms 
of the descendant relation. Matz, Schweikardt and Thomas |27] establish that the 
alternation hierarchy of monadic second-order logic is strict over graphs, in fact, 
even over finite two-dimensional grids. 

We conclude the paper with discussion of open questions in Section [SI 

2 Preliminaries 

A (vertex-) colored graph is a relational structure over vocabulary a = {E, Ui, U2, . . .), 
where E denotes a binary, irrefiexive and symmetric, adjacency relation and Ui 
denotes a unary color relation. We will sometimes write x ~ y instead of E{x,y). 

A first-order formula $ over vocabulary a distinguishes a graph G from a graph 
if if $ is true on G but false on H. Note that, if $ distinguishes G from H, 
then -1$ distinguishes H from G. Furthermore, we say that $ distinguishes G and 
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H if it distinguishes G from H or H from G. A logic £ is a class of formulas. 
For example, the existential logic consists of formulas in the negation normal form 
(where any negation stands in front of a relation symbol) with only existential 
quantifiers. By Dc{G,H) we denote the minimum quantifier depth of a formula 
$ G £ distinguishing G from H. In order to make this notation always meaningful, 
we write Dc{G,H) = oo if G is indistinguishable from H by any formula in C. 
For the existential logic we use notation D^{G,H). Moreover, the superscript k 
in D^{G,H) will indicate that the /c- variable fragment of L is considered. The 
absence of the subscript will mean that the full first-order logic over vocabulary a 
is considered (i.e., £ = FO). 

The vertex set of a graph G is denoted by V{G) and is supposed to be nonempty 
and finite. The number of vertices in G will be denoted by v{G) = \V{G)\. The 
complement of G is the graph on the same vertex set V{G) with any two vertices 
adjacent if and only if they are not adjacent in G. 

The k-pebble Ehrenfeucht game on graphs G and H, denoted by Ehr'^(G', if ), 
is played by two players. Spoiler and Duplicator, to whom we will refer as he and 
she respectively. The players have equal sets of k pairwise different pebbles. A 
round consists of a move of Spoiler followed by a move of Duplicator. Spoiler takes 
a pebble and puts it on a vertex in G or in H. Then Duplicator has to put her 
copy of this pebble on a vertex of the other graph. Duplicator's objective is to 
keep the following condition true after each round: the pebbling should determine 
a partial isomorphism between G and H. The variant of the fc-pebble Ehrenfeucht 
game where Spoiler pebbles vertices always in G is called existential and denoted 
by EnRli{G,H). 

For each positive integer r, the r-round Ehr'^(G', if) (as well as Ehr^(G, if)) 
is a two-person game of perfect information with a finite number of positions. By 
Zermelo's theorem, either Spoiler or Duplicator has a winning strategy in this game, 
that is, a strategy winning against every strategy of the opponent. 

Lemma 2.1 Let s > 1. Then D'^{G,H) < s iff Spoiler has a winning strategy in 
the s-round game Ehr^ (G,ff). 

This well-known game characterization of existential logic generalizes to any finite- 
variable logic with bounded number of quantifier alternations; cf. |28j . 

3 Upper bound for the quantifier depth in FO 

Our definition of a colored graph does not exclude that a vertex can have none or 
more than one colors. It will be convenient to regard the set of the colors that a 
vertex u has as a new color C{u) (the mix of basic colors f/i, [/2, • • •)• The empty set 
of basic colors is allowed; thus, all originally uncolored vertices get colored in the 
same new color. 

We will use a variant of the well-known color refinement procedure. Given a 
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graph G, define a sequence of colorings of V{G) as follows: 

C%u) = C{u), 
C'+\u) = {C\u),{C\a): a^u},{C\a): a^u}). 

The following fact is a direct consequence of the connection between logical distin- 
guishability and the color refinement procedure discovered in [Gj 122]. 

Lemma 3.1 Let s>l. Then D^{G,H) < s iff 

{C'-\u) : ueV{G)] ^ {C'-\v) : veViH)}. (6) 

Theorem 3.2 Let G and H be colored graphs, both having at least two vertices. If 
they are distinguishable in two-variable logic, then 

D\G, H) < min {v{G),v{H)} . 

Since D\G,H) = D\H,G), it suffices to prove that D\G,H) < v{G). The 
proof is based on the well-known "color stabilization" argument; see [HI [22] or [30], 
Section 4]. Given a colored graph G, let Gq denote the partition of V{G) into 
the monochromatic classes of the coloring C*. Since C*^^(m) = C*^^(m') implies 
C*(m) = C*(-u'), the partition Gq~^ is a refinement of Gq. Refinement cannot always 
be proper and eventually stabihzes: for some i < v{G) we have = G^f^ and, 
hence, Gq = Gq for all j > i. Call an index s > 1 stable for G if G^^ = Gq. 

Lemma 3.3 If s is stable for G, then D'^{G, H) < s + 1 for any H distinguishable 
from G in the two-variable logic. 

Proof. We will need the following general notion. Given a partition P of V{G), 
the quotient graph GjP has P as the vertex set with A,BeP being adjacent if G 
contains an edge between a vertex in A and a vertex in B. Note that G/P can have 
loops. 

If Inequality ([6]) is true, then D^{G,H) < s by Lemma [3?T1 Suppose, therefore, 
that the two sets in IQ are equal. 

If s is stable for H too, consider the stable partitions P = G^^ and Q = G^^. 
Consider also the one-to-one map f : P ^ Q taking each element of P to the element 
of Q of the same C^~^-color. Note that / cannot be an isomorphism between the 
quotient graphs G/P and H/Q for else the color sets in would be equal not only 
for the index s but also for all subsequent indices, and we would have D'^{G, H) = oo 
by Lemma Em Therefore, P contains partition elements A and B that are adjacent 
in G/P while f{A) and f{B) are non-adjacent in H/Q or vice versa {f{A) and f{B) 
are adjacent while A and B are not). This implies that A and f{A) have different 
C"^-colors. If C G P and G ^ A, then A and f{G) also have different C'^-colors just 
because their C*~^-colors are different. It follows that 

{G'iu) : u G ViG)} ^ {G%v) : v G ViH)} (7) 

and we have D'^{G, H) < s + 1 by Lemma [3. 1[ 

It remains the case that s is not stable for H. Inequality ([7]) holds true because 
the set in the right hand side contains more colors. Thus, D^{G, H) < s + 1 in both 
cases. ■ 
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The color stabilization argument readily implies that n — v{G) is stable for any 
graph G. An improvement to n — 1 takes some work. 

Lemma 3.4 The index n — 1 is stable for every colored graph G on n > 2 vertices. 

Proof. If in the coloring C oi G there are at least two colors, then \Cq\ > 2, i.e., 
the partition Cq consists of two elements, and we are done because then at most 
n — 2 proper refinement steps are possible. Assume, therefore, that there is only one 
color or, equivalently, G is an uncolored graph. 

li n — 2, then G is either complete or empty. By this reason Cq — Cq, and the 
lemma is true. Assume, therefore, that n > 3. 

We will need some graph-theoretic terminology. A set of vertices M is called a 
module of G if any two vertices in M have the same adjacency pattern to the rest of 
the graph, that is, if x, 7/ e M and z ^ M, then x ~ 2; iff y ~ Two vertices x and 
y are called twins if they form a module or, equivalently, if the transposition of x 
and y is an automorphism of the graph. A vertex is isolated if it is not adjacent to 
any vertex. A vertex is universal if it is adjacent to all other vertices. Furthermore, 
we will call a vertex normal, if it is neither isolated nor universal. 

We will say that is a small refinement of Cq if \C^q^\ = \Cq\ + 1, that is, 
all but one elements of the partition Cq stay the same in Cq^ and the exceptional 
one is split up into two parts. 

Claim A. If Cq, Cq,...,Cq is a sequence of t small refinements and X e C^^ is 
split into Y and Z in Cq, then one of the two parts of X, say Y, consists of twins 
of G, and the other part, Z, is a module of G. 

Proof of Claim A. We proceed by induction on t. In the base case, when t = 1, 
we have Cq being a partition of X = V{G) into two parts, Y and Z. Note that in 
the coloring C^ there can appear three colors; let us call them /, U, and N. More 
specifically, all isolated vertices get the color I, all universal get U, and all normal 
get N. By assumption, exactly two of the three colors appear in C^. Obviously, 
the combination {I,U} is impossible. Suppose that the colors appearing in C^ are 
/ and (the case of {U, N} is similar; it is transformed to the case of {/, A^} by 
taking the complement of G). Let Y and Z consist of the isolated and normal 
vertices respectively. The claim is true because all isolated vertices are twins, and 
all non-isolated vertices form a module. 

Let t > 1. By the induction assumption, all but one elements of are twin- 
classes, and the exceptional element is a module. Since no twin-class can be split 
up, X is this exceptional element. Since X is a module, {Y, Z} coincides with the 
partition Cq^^^, where G[X] denotes the subgraph of G induced on X. By the 
argument used in the base case, Y consists either of isolated or of universal vertices 
of (^[X], and the remaining part Z is a module of GfX]. Referring once again to the 
fact that X is a module, we conclude that F is a twin-class of G and Z is a module 
ofG. <J 

Consider now the sequence of partitions Cq, Cq, . . . , Cq~'^. If it contains a pair 
= (7^^ then Cq'^ = Cg~^ = Cg"\ and the lemma is true. If there is a pair 
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Cq,Cq~'^ with IC^f-^l > \Cq\ + 2, then we are done by the color stabihzation argu- 
ment. In the only remaining case, the whole sequence consists of small refinements. 
Claim R] applies to each t < n — 2. If we have \Y\ > 2 for some t < n — 2, we are 
done because the twin-class Y cannot be split up later, and hence proper refinement 
terminates at the step n — 2 at latest. Assume that \Y\ = 1 for every t < n — 2. 
Then for t = n — 2 we have \Z\ = 2, and Z is a module of G. Therefore, Z consists 
of two twins. It follows that Cq~^ = Cq~'^, completing the proof of the lemma. ■ 

Remark 3.5 Lemma [3.41 cannot be further improved because the value of n — 1 is 
best possible for all graphs in the following sequence: Gi consists of two isolated 
vertices, and Gj+i is obtained by taking the complement of Gi and adding an isolated 
vertex to it. 

Theorem 13.21 follows directly from Lemmas 13.31 and 13.41 

We conclude this subsection with a brief discussion of the general case of k 
variables. The approach used here to prove Theorem 13.21 is borrowed from [30] , 
where we focused on the case of A; > 3. In [5U] we observed that D^{G,H) < 
Stab''-\G) + k, where Stab'^{G) denotes the minimum stable index for the Weisfeiler- 
Lehman coloring of V{GY. If d > 2, this coloring is from the very beginning not 
monochromatic. More specifically, in the beginning V{GY is partitioned into the 
types of (i-tuples with respect to equality and adjacency relations. It follows that 
Stah'^iG) < n'^ — \C^\ + 1, where \C'^\ denotes the size of the initial coloring of V{Gy. 
If n > d, the number of equality types is equal to the Bell number Bd, which suffices 
to derive the bound ([1]) if /c > 4. If k = 3, counting the equality types alone does 
not suffice but then it is enough to notice that, unless G is complete or empty, all 
but one equahty types are split further by taking adjacency into account. 

4 Upper bound for existential logic: Uncolored 
graphs 

Theorem 4.1 Let G and H be uncolored graphs. If G is distinguishable from H in 
the existential two-variable logic, then 

Dl{G,H) < 2v{H). 

The proof of Theorem 14.11 takes the rest of this subsection. We will actively 
exploit the following graph-theoretic notions. 

We call a graph normal if all its vertices are normal (i.e., it has neither isolated 
nor universal vertex). Note that a graph is normal iff its complement is normal. 
Let K be an induced subgraph of a graph G. If is a normal graph, we call it a 
normal subgraph of G. Furthermore, we define a class of abnormal graphs of rank i 
inductively as follows: 

• disconnected abnormal graphs of rank 1 are exactly empty graphs with at least 
2 vertices; 
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• a disconnected abnormal graph of rank i + 1 is a graph that consists of a 
connected abnormal graph of rank i and a number of isolated vertices; 

• connected abnormal graphs of rank i are exactly complements of disconnected 
abnormal graphs of rank i. 

The rank of an abnormal graph G will be denoted by rkG. Note that examples of 
abnormal graphs already appeared in Remark 13.51 above. 

Lemma 4.2 Let G be a graph with n >2 vertices. Then G has no normal subgraph 
iff G is abnormal. 

Proof. In order to prove the lemma in the forward direction, we use induction on n. 
Since G itself is not normal, it has either an isolated or an universal vertex. W.l.o.g., 
assume the former (otherwise consider G). If all vertices in G are isolated, which is 
so in the base case of n = 2, this is an abnormal graph of rank 1. Otherwise, let G' 
be obtained from G by removing all isolated vertices. G' is an induced subgraph of G 
and, hence, has no normal subgraph. By the induction assumption, G' is abnormal. 
By construction, G' has no isolated vertex. Moreover, G' is connected for else it 
would have also no universal vertex and, hence, would be a normal subgraph of G. 
Therefore, G is abnormal with rkG = rkC + 1. 

For the backward direction, we use induction on i = ikG. In the base case 
of i = 1, the graph G is either empty or complete and does not have any normal 
subgraph indeed. Let i > 1. Assume that G is disconnected (otherwise consider G). 
Then G consists of an abnormal graph G' of rank i — 1 and some isolated vertices. 
If an induced subgraph K of G contains at least one of the latter, this is an isolated 
vertex in K, and K is not a normal graph. Otherwise, K is an induced subgraph of 
G' and cannot be a normal graph by the induction assumption. ■ 

Suppose that G is an abnormal graph. We now introduce a ranking of vertices in 
G. If G is disconnected, it has at least one isolated vertex; if G is connected, there 
is at least one universal vertex. Denote the set of such vertices by dG. Every vertex 
in dG is assigned rank 1. In particular, if rkG = 1, then all vertices of G have rank 
1. If u ^ dG, then it is assigned rank one greater than the rank of u in the graph 
G — dG. The rank of a vertex m in G will be denoted by rku. It ranges from the 
lowest value 1 to the highest value rkG. 

Note that a vertex u has the same adjacency to all other vertices of equal or 
higher rank. If u is adjacent to all such vertices, we say that u is of universal type; 
otherwise we say that it is of isolated type. Note that the type gets flipped when the 
rank changes by one. 

Let us turn to Theorem 14.11 By Lemma 12.11 we have to show that if Spoiler 
has a winning strategy in FjKR^{G, H) for some number of rounds, then he is able 
to win fast enough, that is, in at most 2v{H) rounds. We split our analysis into 
several cases. 

Case I: H has a normal subgraph K . Duplicator can win any number of rounds 
by playing inside K. Whatever vertex v G V{K) is pebbled in the preceding round, 
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she is able to find another vertex v' G V{K), no matter whether it has to be adjacent 
to V or not. 

From now on we will assume that H has no normal subgraph. By Lemma 14.21 
H is abnormal. 

Case II: G has a normal subgraph K while H is an abnormal graph. Spoiler has 
a strategy allowing him to win within rkif + 1 < v{H) rounds. He always pebbles 
vertices inside K. Suppose that a pair of vertices (m, v) G V{K) x V{H) is pebbled in 
the preceding round. If v is of universal type, Spoiler pebbles a G V{K) nonadjacent 
to u. Duplicator is forced to respond with a vertex h nonadjacent to f , which implies 
that rk6 < ikv. If v is of isolated type, Spoiler pebbles a G V{K) adjacent to u and 
again forces pebbling h G V{H) of lower rank. If Duplicator respects adjacency all 
the time. Spoiler eventually forces rk6 = 1. He plays the next round in the same 
way, demanding the adjacency condition opposite to the adjacency that h has to the 
rest of the graph H, and wins. We will refer to this kind of strategy, that forces 
pebbling a vertex of always lower rank, as a force- out strategy. 

From now on we will assume that, like H, the graph G has no normal subgraph. 
By Lemma [4. 2 [ G is also abnormal. 

Case III: Both G and H are abnormal. This case is most complicated and 
requires a finer analysis. 

Claim B. Suppose that the players pebble a pair of vertices (m, v) G V{G) x V{H). 
If rk-u > rkf or if rkw = rkw and these vertices are of different types (i.e., one is 
of isolated type while the other is of universal type), then Spoiler is able to win no 
later than in the next rk v moves. 

Proof of Claim B. Spoiler has a force-out strategy. We describe it recursively and 
simultaneously prove the claim by induction on ikv. 

If rkf = 1, the vertex v is either isolated or universal in H. Note that the latter 
case is obtained from the former by considering the game on G and H, which is 
virtually the same. Suppose, therefore, that v is isolated. If rkw > 1, the vertex u 
is not isolated in G. If rkw = 1, then u is not isolated as well (by assumption, it 
is universal in G). In the next round. Spoiler pebbles a vertex a adjacent to u and 
wins. 

Let rki; > 2. Assume first that u and v are of the same type. Thus, rku > rkv. 
W.l.o.g., suppose that both u and v are of isolated type. In the next round Spoiler 
pebbles a vertex a of rank iku — 1. Since a is of universal type, it is adjacent to u. 
Since v is nonadjacent to all vertices of equal or heigher rank. Duplicator is forced 
to pebble a vertex 6 in if of rank rk6 < ikv. Since rk6 < rka. Spoiler wins in the 
next rk 6 < rk — 1 moves by the induction assumption. 

Assume now that u and v are of different types. W.l.o.g., suppose that u is of 
isolated type while v is of universal type. If rkw < rkG, Spoiler pebbles a vertex a 
of rank rkw + 1. If rk-u = rkG, Spoiler pebbles another vertex a of the same rank 
rkG (there are always at least two vertices of the highest rank). In either case, a is 
nonadjacent to u. This forces Duplicator to pebble a vertex b of rank rk6 < rkv. 
Now rk6 < rka and, again by the induction assumption. Spoiler wins in the next 
rk 6 < rk — 1 moves. < 
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Case Ill-a: ikG > ikH or ikG = ikH with one graph connected and the 
other disconnected. Spoiler is able to win within rkH + 1 < v{H) rounds. In the 
first round, he pebbles a vertex u of rank rk if + 1 or of the highest rank rk H if 
rkG = ikH. Whatever Duplicator's response v is, its rank is at most rkH. Since 
the pair {u, v) meets the conditions of Claim[Bl Spoiler wins in the next ikH moves. 

Case Ill-b: ikG = ikH = r and the graphs are both connected or both discon- 
nected. 

We will say that G is distinguishable from H at rank I ii G has at least two 
vertices of rank £ while H has only one vertex of this rank. We split Case Ill-b 
further depending on whether or not such I exists. 

Case III-b-1: G is indistinguishable from H at any rank i. Duplicator has 
a simple winning strategy, no matter how long the game is played. She always 
responds with a vertex having the same rank as the vertex pebbled in this round by 
Spoiler. 

Case III-b-2: G is distinguishable from H at some rank i. Spoiler has a strategy 
allowing him to win within 2r + 2 < 2v{H) rounds. In the first two rounds he 
pebbles two vertices of rank Since there is only one vertex of this rank in H, 
this forces pebbling a pair (u, v) G V{G) x V{H) such that rku = i while rkv i. 
If rk f < i, Spoiler applies the force-out strategy of Claim |B] and wins in at most 
rkf<£ — l<r — 1 extra moves. If rkf > i, then v has higher rank than u. Spoiler's 
objective is now to increase the rank of a vertex pebbled in G, making it greater 
than the rank of the corresponding vertex in H; this will make the force-out strategy 
applicable again. This objective is achieved by the overtaking strategy described in 
the proof of the following claim. 

Claim C. Once a pair {u, v) with rku < rkf is pebbled. Spoiler is able in at most 
r — rkf + 2 moves to force pebbling a pair {u', v') such that rku' > ikv' . 
Proof of Claim C. Consider first the case that u and v are of different types. W.l.o.g., 
let u be of isolated type and v be of universal type. Spoiler pebbles a vertex u' of 
rank rkf. Note that u and u' are not adjacent. Since in H all vertices of higher or 
equal rank are adjacent to f , Duplicator is forced to pebble a vertex v' of rank less 
than rkf = rku'. 

Now, consider the case that u and v are of the same type. Note that then 
iku < rkf — 2. W.l.o.g., suppose that both u and v are of isolated type. In the 
next round. Spoiler pebbles a vertex a of rank rkf — 1. Note that a and u are not 
adjacent, and a is of universal type. Let h denote Duplicator's response. 

If rk 6 < rk a. Spoiler's objective is achieved. If rk 6 = rk a = rk f — 1, then h is of 
universal type and, hence, adjacent to f ; Duplicator is dead. Let rk6 = rka -|- 1 = 
rkf. Though we still have rka < rk6, the vertices a and h are now of different 
types, and Spoiler overtakes Duplicator in one extra move. Finally, assume that 
rk6 > rkf. If a and h are of different types. Spoiler needs only one extra round as 
above. If a and h are of the same type. Spoiler plays the next round in the same 
way. It remains to notice that rk h can increase at most r — rk f times. < 

Let us turn to analysis of Subcase III-b-2. If rkf < rkf, then Spoiler applies 
the overtaking strategy. By Claim [CI in at most r — rkf-|-2 < r moves he forces 
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pebbling a pair («', f ') with iku' > ikv' and then wins in at most ikv' < r extra 
moves by using the force-out strategy. 

Case III-c: rkG = rkH — 1 = r. W.l.o.g., suppose that H is disconnected. Let 
H' denote the graph obtained from H by removal of all isolated vertices. Thus, H' 
is a connected abnormal graph of rank r. 

Case III-c-1: G is connected (while H is not). This case is essentially equivalent 
to Case Ill-b for the graphs G and H' . If G is indistinguishable from H' at any 
rank, then Duplicator survives in EhR3(G, if') arbitrarily long, and the same holds 
true for YjUR^^G^H) (Duplicator just plays always in H'). 

Assume that G is distinguishable from H' at some rank £. Like as in Case 
III-b-2, Spoiler wins EHR^(G',if') in at most 2r + 2 moves. Applying the same 
strategy in EhR3(G, if), within the same number of rounds Spoiler either also wins 
(if Duplicator plays inside H') or forces pebbling a pair of vertices {u,v) G V{G) x 
V{H) such that v is isolated in H (once Duplicator steps outside H'). In the latter 
case Spoiler pebbles a neighbor of u and wins. He can always do so, as G has no 
isolated vertex. Alltogether, Spoiler needs at most 2r + 3 < 2v{H) rounds to win. 

Case III-c-2: G is disconnected (like H). Duplicator has a strategy allowing her 
resist arbitrarily long. 

Suppose that Spoiler pebbles a vertex u. We say that Duplicator's move v in 
response is straight ii ikv = iku. We call it shifted if rkf = rk-u + 2 (hence 
rkti < rkG) or if rkw = rkw + 1 while iku = rkG. The pair of vertices {u,v) will 
be called straight or shifted correspondingly. 

In the first round Duplicator makes a straight move. In general, suppose that a 
pair (m, v) was pebbled in the preceding round, and Spoiler pebbles a G V{G) in the 
current round. We will describe how Duplicator chooses her response b G V{H). 

Assume that [u, v) is a straight pair. Whenever possible. Duplicator's move b is 
also straight. Note that making another straight move immediately after a straight 
move is never loosing for Duplicator. Playing so is impossible in the only case when 
a 7^ u, Tku = rkf, and G is distinguishable from H at this rank. In this case 
Duplicator makes a shifted move, which is nonlosing also in this case because u and 
V are of the same type while both rka > rkw and rk6 > rkf, hence a ~ m iff 6 ~ f . 

Assume now that {u,v) is a shifted pair. If rka = rkw and a ^ u, Duplicator 
makes a straight move. She does not lose because a and b are of the same type while 
both rk-u > rka and rkf > rk6. If rka 7^ rk-u. Duplicator makes a shifted move 
and does not lose. Indeed, w.l.o.g. suppose that rka > rk-u. Then rkt; = rk-u + 2 
and rk6 > rkf (even when rkfe = rka + 1). Since u and v are of the same type, we 
have a ~ u iff 6 ~ f . 

Case Ill-d: rkG < ikH — 2. Duplicator has a winning strategy for any number 
of rounds because H has an induced abnormal subgraph if' of rk if = rk G + 1 and 
of the same connectivity as G. Duplicator always plays in H', following her winning 
strategy in EHR3(G,ii') as described above in Case III-c-2. 

The proof of Theorem 14.11 is complete. As a by-product, it gives us also the 
following result. 
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Corollary 4.3 The problem of deciding whether or not one uncolored graph is dis- 
tiguishable from another in the existential two-variable logic is solvable in logarithmic 
space. 

Proof. Given uncolored graphs G and H, we actually have to decide if there is 
r such that Spoiler has a winning strategy in the r-round game E}iRl{G,H). It 
suffices to determine which of the cases 1-111 and which of their subcases takes place 
for G and H. This task easily reduces to the following problem: Given a graph G, 
we have to recognize if it is abnormal and, if so, we should be able to compute rank 
and type of each vertex of the graph. It turns out that this is not much harder than 
computing vertex degrees. 

Note ffist that the degree of a vertex of isolated type increases together with 
its rank. The degree of a vertex of universal type decreases as its rank increases. 
Moreover, any vertex of universal type has larger degree than any vertex of isolated 
type. It suffices to check this inequality between vertices of different types that have 
the two highest possible ranks. Let iku = rkC and rkf = rkG — 1 and suppose 
that u is of isolated type. Then the degree of u is equal to the number U of vertices 
of universal type in G while the degree of v is one smaller that the sum of U and 
the number R of all vertices of the highest rank rk G. If u is of universal type, then 
degM = U — 1 while degv = U — R. Since i? > 2, we have degv > degu in the 
former case and degu > degv in the latter case. 

These observations suggest the following algorithm for recognition of abnormal 
graphs and, in passing, for computing ranks of vertices in an abnormal graphs and 
determining their types. If an input graph G has neither isolated nor universal 
vertex, it is not abnormal. If G has a universal vertex, it suffices to check if G is 
abnormal. Suppose, therefore, that G has an isolated vertex. We ffist compute all 
vertex degrees. Suppose that we obtain d different degree values. Classify a vertex 
as being of isolated type if its degree is among smallest values and as being of 
universal type otherwise. Assign odd ranks to vertices of isolated type and even 
ranks to vertices of universal type by the following rules: The larger degree a vertex 
has, the higher rank it receives if it is of isolated type and the lower rank it receives if 
it is of universal type. Finally, we have to check if the following adjacency condition 
is obeyed: Vertices of isolated type are nonadjacent to all vertices of equal or higher 
rank, while vertices of universal type are adjacent to all vertices of equal or higher 
rank (in particular, vertices of the same degree are twins). ■ 

Remark 4.4 The analysis of Case 11 in the proof of Theorem 14 . 1 1 leads to a monochro- 
matic analog of the example shown in Fig. [H Let if„ be an abnormal graph of 
rank n — 1 on n vertices. Fix an arbitrary normal graph, for example, P4. Then 
Dl{P,,H^) = n. 
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5 Upper bound for existential logic: Colored graphs 



Theorem 5.1 Let G and H be colored graphs. If G is distinguishable from H in 
the existential two-variable logic, then 

Dl{G,H) <v{G)v{H). 

Proof. By Lemma 12.11 we have to prove that, if Spoiler has a winning strategy 
in the s-round game EnRl{G,H) for some s, then he has a winning strategy in the 
game with v{G)v{H) rounds. 

The proof is based on a general game-theoretic argument. Consider a two-person 
game, where the players follow some fixed strategies and one of them wins. Then the 
length of the game cannot exceed the total number of all possible positions because 
once a position occurs twice, the play falls into an endless loop. Here it is assumed 
that players's strategies are non-adaptive, that is, that a strategy of a player maps 
a current position (rather than the whole sequence of previous positions) to one of 
the positions available for the player. We have to show that we really can use this 
assumption for the Ehrenfeucht game. Moreover, since the number of positions in 
EnR^(G,H) is {v{G)v{H)y, we have to "reduce" it. To this end, instead of the 
space f (G)^ x v{H)'^ of positions in the 2-pebble Ehrenfeucht game, we will consider 
the space v{G) x v{H) of configurations. The idea appears aleady in the standard 
proof of Lemma 12.11 the players can play the 2-pebble Ehrenfeucht game optimally 
by using strategies that are non-adaptive in a rather strong sense. The strong non- 
adaptiveness means here that the next move is computed based on the outcome of 
the preceding round (which is only a part of the current position formed by the 
outcome of the two preceding rounds). 

Given u G V{G) and v G V{H), let r^^v denote the smallest r > such that 
Spoiler has a winning strategy in the r-round game Ehr^(G, m, if, f ), where the 
vertex pair (m, v) is pebbled before the game starts. Here ru,v = if even the initial 
configuration {u, v) is already losing for Duplicator, that is, u and v have different 
colors. Furthermore, we fix a strategy Su.v winning for Spoiler in the t,-round 
EurI{G,u,H,v). 

Claim D. Suppose that r„^^ > 1. Let a be the vertex of G that Su^y brings forth in 
the first round of Err^^G^u, H,v). Then, for every b G V{H), we have r^^ft < r^^v 

Proof of Claim D. If we had ra^b > i^u,v, Duplicator could win the r„ ,u-round 
Ehr^(G, u, H, v) by pebbling b in the first round. < 

Define a strategy S for Spoiler as follows. 

1-ST ROUND. Spoiler pebbles a vertex ui G V{G) such that ma:Xyj^v(H)fu.w is as 
small as possible (we fix such ui arbitrarily if there is more than one choice). 

{i + 1)-TH ROUND. If the pair {u, v) G V{G) x V{H) was pebbled in the i-th round. 
Spoiler places the other pebble on the vertex a produced by Su,v in the first 
round of EurI{G, m, H, v). 
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We now argue that S* is a winning strategy for Spoiler in the v{G)v{H)-Tound 
game EnR^{G,H). Indeed, let Spoiler follow S, fix a strategy for Duplicator arbi- 
trarily, and denote the outcome of the i-th round by (uj, Vi). Assume that Duplicator 
survives in t rounds and consider the sequence 

{Ui,Vi),{u2,V2),...,{Ut,Vt). (8) 

Thus, we have ru^^m > for alH < t and, by Claim [D| we also have r„-,„. > r^-^^^u-^j 
for all i < t. It follows that all pairs in the sequence (j8]) are pairwise different. If 
at least two colors appear in the graphs, then at least one pair [u, v) must have 
i^u,v = 0. Therefore, not all pairs from V{G) x V{H) can appear in the sequence 
([8]), and we have t < v{G)v{H) — 1, which implies the lemma. 

Suppose now that all vertices G and H are equally colored. Since this is equiv- 
alent to the uncolored case. Theorem 14.11 applies and gives us D'^{G,H) < 2v{H), 
which stays whithin the bound v{G)v{H) unless v{G) = 1. In the last case the 
theorem is trivially true just because the single- vertex graph G is indistinguishable 
from any H in FO^. ■ 

Theorem 15.11 can easily be extended to any fc-variable logic. 

Theorem 5.2 Let k > 3. If a graph G is distinguishable from a graph H in the 
existential k-variable logic, then 

Dl{G,H)<{v{G)v{H)f'\ 

Proof. In order to adapt the proof of Theorem 15.11 for the general case of k 
variables, we consider tuples of vertices u G V{G)'^~^ and v G V{H)''~^ and again 
define r„ „ to be the smallest r such that Spoiler has a winning strategy in the r- 
round Ehrenfeucht game v) on G and H with the initial configuration 

{u,v) X V{G)^^^ X V{H)^~^. We can suppose that in the first round Spoiler puts 
all k pebbles on the same vertex. Assume that the {i — l)-th round has been played 
with fc-tuples of vertices u G V{G)^ and v G V{H)^ being pebbled. Given j < k, 
let aju G V{G)'^~^ denote the {k — l)-tuple obtained from u by removal of the j-th 
coordinate. In the i-th round, the processs of making decision by Spoiler can be 
split into two steps. 

1. He first decides which of the k pairs of pebbles will play in the next round. 

2. If the j-th pair is chosen, then Spoiler actually plays the first round of the 
game FiUR'^(G,Ui, H,Vi), where Ui = aju and Vi = ajv. 

Assume that Spoiler plays optimally, that is, in Step 1 he chooses j minimizing the 
value Tui^vi, and in Step 2 he invokes a winning strategy for the r^.^^.-round game 
ERR'^{G,Ui, H,Vi). Then the elements of the sequence ([8]) are pairwise different. 
This implies that t < {v{G)v{H))'^~^ — 2 because not all pairs from V{G)'^~^ x 
V{H)^~^ can occur in ([8]); for example, none of the pairs {u, v) where u and v have 
different equality types. ■ 
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Figure 2: Proof of Theorem 16. 1[ The only way for Spoiler to win is to force pebbling 
the pair of vertices (oq, a'^) in a round. Duplicator can force Spoiler to make many 
passes around Gm before this goal is achieved. 

6 Lower bound for the existential two-variable 
logic 

Theorem 6.1 There are infinitely many colored graphs G and H , both on n vertices, 
such that 

DliG,H)>^^n' 

while G is distinguishble from H in the existential-positive two-variable logic. 

Proof. Our construction will depend on an integer parameter m > 2. We construct 
a pair of colored graphs Gm and Hm such that Gm is distinguishable from Hm in 
the existential two-variable logic, both v{Gm) = 0{m) and v{Hm) = 0{m), and 
D^{Gm, Hm) = Though v{Gm) < v{Hm), later we will be able to increase 

the number of vertices in Gm to v{Hm)- 

The graphs have vertices of 4 colors, namely apricot, blue, cyan, and dandelion. 
Gm. contains a cycle of length 3(2m — 1) where apricot, blue, and cyan alternate 
in this order; see Fig. [2J Hm contains a similar cycle of length 3 ■ 2 m. Successive 
apricot, blue, and cyan vertices will be denoted by a^, hi, and Cj in Gm, where 
< i < 2m — 1, and by a[, b[, and c[ in Hm, where < i < 2m — 1. Furthermore, 
the vertex oq is adjacent to a dandelion vertex do, and every a'j except for i = m is 
adjacent to a dandelion vertex d'^. This completes the description of the graphs. 

By Lemma [2. H we have to show that Spoiler is able to win FiUR'^{Gm, Hm) and 
that Duplicator is able to prevent losing the game for f2(m^) rounds. 

Note that, once the pair (ao,a'^) is pebbled. Spoiler wins in the next move by 
pebbling do. He is able to force pebbling {ao,a'm) as follows. In the first round 
he pebbles ao. Suppose that Duplicator responds with a'^, where < s < m. In 
a series of subsequent moves. Spoiler goes around the whole circle in Gm, visiting 
C2m-2, b2m-2, 0'2m-2, C2m-2, ■ ■ ■ {H m < s < 2m, he docs the same but in the other 
direction). As Spoiler comes back to ao. Duplicator is forced to arrive at a's+i. The 
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next Spoiler's tour around the circle brings Duplicator to a'^_,_2 and so forth. Thus, 
the most successful moves for Duplicator in the first round is Oq. Then Spoiler needs 
to play 1 + m(2m — 1) + 1 = 2m^ — m + 2 rounds in order to win. 

Since Spoiler keeps the adjacency relation true in every round, he wins even 
the existential two-pebble game, where Duplicator's objective is relaxed to keeping 
a partial homomorphism. It follows by [23] that G is distinguishble from H in the 
existential-positive two-variable logic. 

Our next task is to design a strategy for Duplicator allowing her to survive f2(m^) 
rounds, no matter how Spoiler plays. The idea is actually similar to the idea beyond 
the simple example depicted by Fig. [TJ Duplicator is able to force Spoiler to pass 
around the cycle in Gm many times. A crucial observation is that {aQ,a'^) is the 
only pair whose pebbling allows Spoiler to win in one extra move. 

Let us regard the additive group as a cycle graph with i and j adjacent iff 
i — j = ±1. Denote the distance between vertices in this graph by A. The same 
letter will denote the following partial function A : V{Gm) x V{Hm) — ?■ For two 
vertices of the same color, say, for and a'j, we set A(aj,a^) = A{i,j). Note that 
A(ao, a'^) = m, which is the largest possible value. Duplicator' strategy will be to 
keep the value of the A-function on the pebbled pair as small as possible. 

Specifically, in the first round Duplicator responds to Spoiler's move x with 
pebbling a vertex x' such that A(x, x') = (that is, if x = ai,bi,Ci,do, then x' = 
a[,b[,c[,d'Q respectively). Suppose that a pair {y,y') is pebbled in the preceding 
round and Spoiler is still alive. If Spoiler pebbles x in the current round. Duplicator 
chooses her response x' by the following criteria. 

• x' should have the same color as x and, moreover, x' ^ y' iS x ^ y (this is 
always possible unless {y, y') = (oq, a^) and x = do); 

• if there is still more than one choice, x' should minimize the parameter A(a:, x'). 

We do not consider the cases when x = y oi when x is pebbled by the pebble removed 
from y because, in our analysis, we can assume that Spoiler uses an optimal strategy, 
allowing him to win 'EjWR^iGm, y, Hm, y') in the smallest possible number of rounds 
(if he does not play optimally. Duplicator survives even longer). 

Claim E. \i x ^ y and x ^ y, then A(a;, x') < 1. 

Proof of Claim E. Assume first that x ^ d^ and y ^ d^. W.l.o.g., suppose that 
y and y' are apricot and, specifically, y' = a'j (the blue and the cyan cases are 
sjTumetric to the apricot case). Not to lose immediately. Duplicator cannot pebble 
x' in {c'j_i,a'j,b'j}, where j — 1 is supposed to be an element of Z2m- This can 
obstruct attaining A{x,x') = (if x G {cj_i, a^, 6j}), but then there is a choice of 
x' with A(x,x') = 1. 

Assume now that x = d^. Then x' = dg if y' ^ a'g and x' = d!^ otherwise. In both 
cases A(x, x') < 1. 

Finally, let y = do and y' = d'j. Then the value x' = a'j is forbidden and, if this 
prevents A(x,x') = 0, then we have A(x,x') = 1. < 

Consider now the dynamical behaviour of A(x,x'), assuming that Duplicator 
uses the above strategy and Spoiler follows an optimal winning strategy. We have 
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Figure 3: A bicolored modification of tlie example in Fig. |2l 



A[x, x') = at tfie beginning of the game and A(x, x') = m at the end (that is, in 
the round immediately before Spoiler wins). Consider the last round of the game 
where A(x, x') < 1. By Claim[El starting from the next round Spoiler always moves 
along an edge in Gm- Note that, from now on, visiting do earlier than in the very 
last round would be inoptimal. Therefore, Spoiler walks along the circle. Another 
consequence of optimality is that he moves always in the same direction. 

By symmetry, we can suppose that Spoiler moves in the ascending order of 
indices. Note that A{x,x') increases by 1 only under the transition from x = a2m-2 
to X = ao (at point, the index of x makes a jump in Z2m, while the index of x' moves 
along always continuously). In order to increase A{x, x') from 1 to m, the edge 
a2m-2'2o must be passed m — 1 times. It follows that, before Spoiler wins, the game 
lasts at least 2 + (m — 2) ■ 3 (2m — 1) = 6m^ — 15m + 8 rounds. 

Note that v{Gm) = 6m — 2 and v{Hm) = 8m — 1. In order to make the number of 
vertices in both graphs n = 8m— 1, let m be multiple of 3 and add two new connected 
components to Gm, namely the cycle of length 2m with alternating colors apricot, 
blue, and cyan and one isolated vertex af any color. Spoiler can still win by playing 
in the old component. Since playing in the new components does not help him, the 
game on the modified Gm and the same Hm lasts at least 6m^ — 15m+8 = ^ n'^—0{n) 
rounds. ■ 

Remark 6.2 In order to facilitate the exposition, the construction of graphs Gm 
and Hm uses 4 colors. In fact, the same idea can be realized with 2 colors. Ap- 
propriately modified Gm and Hm. are shown in Fig. [31 The larger graph Hm has 
now n = 10m — 1 vertices, and the game lasts more than — 0{n) rounds. For 
simplicity of presentation, we did not try to optimize neither the constant factor of 
^ here nor the factor of ^ in Theorem 16. II However, two colors are optimal in view 
of Theorem 14.11 

Remark 6.3 Theorem 16.11 has no analog for the logic with counting. Indeed, let 
G and H be distinguishable in F0# and v{G) = v{H) = n. Then D^{G,H) = 
D^^{G, H) because the universal quantifier can be replaced with the counting quan- 
tifier 3"'. It remains to note that D'^{G, H) < n similarly to Theorem 13.21 
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7 Quantifier alternation hierarchy 



Rephrasing Remark 16.31 one can say that the quantifier ahernation hierarchy of 
FO^ collapses over pairs of graphs with the same number of vertices. Let us look 
at the standard logic without counting. 

For each i > 1, let (resp. Ilj) denote the set of first-order formulas $ in 
the negation normal form with the following property: Any sequence of nested 
quantifiers in $ has at most i — 1 quantifier alternations and begins with 3 (resp. 
V) if it has exactly i — 1 quantifier alternations. By definition, Sj U Ilj C Sj+i fl 
Ilj+i and Si is existential first-order logic. By the quantifier alternation hierarchy 
we mean the interlacing chains Si C S2 C and Hi C 112 C We are interested 
in the corresponding fragments of a finite-variable logic. Sometimes we can write 
Dl^{G,H) for Di^{G,H) and D^^{G,H) for D^^{G,H). 

If > 3, we will see that the alternation hierarchy of FO'^ is strict even over 
uncolored trees. If k = 2, the colored and the uncolored cases are different. While 
the alternation hierarchy of FO^ is still strict over colored trees, over uncolored 
graphs it collapses in the sense made precise in the following result. 

Theorem 7.1 If uncolored graphs G and H are distinguished in FO^ , then they are 
distinguished in its T,2-fragment with quantifier depth at most max{v{G),v{H)} . 

Proof. We make use of our analysis done in the proof of Theorem 14.11 It gives us 
a complete description of the pairs of graphs with D^{G, H) < 00 and, by swapping 
G and H, also a complete description of the pairs with D^{G, H) < 00. In all these 
cases, exept III-b-2 and the similar Case III-c-1, we have 

mm{Dl{G,H),D^{G,H)} < m8ix{v{G),v{H)}. 

In the two exceptional cases we have a twice worse bound because Spoiler may need 
to use the overtaking strategy before he runs the force-out strategy. However, the 
overtaking phase is not needed at all if Spoiler is allowed to jump to the other graph. 
This observation shows that we have 

mm{Di,{G,H),D^^{G,H)} < max{v{G),v{H)} (9) 

whenever mm{Dl{G, H), D^{G, H)} < 00. 

The last condition is not fulfilled in two cases. First, this happens if both pairs 
{G,H) and {H,G) fall into Case III-b-1. In this case we actually have D^{G,H) = 
00. The other situation when this happens is the subcase of Case I where both G 
and H have normal subgraphs. In the rest of the proof we show that also here we 
have the bound unless D'^{G,H) = 00. 

It suffices to prove that, unless D'^{G, H) = 00, the graphs G and H are distin- 
guishable in S2nF0^ with quantifier depth at most v{G). We proceed by induction 
on v{G). Note that all graphs with less than 4 vertices are abnormal and, hence, 
v{G) = 4 in the base case. 

If both G and H are normal, then D^[G,H) = 00. Note that this includes the 
base case because the only way for a graph on 4 vertices to have normal subgraphs 
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is to be normal itself. If exactly one of G and H is normal, that is, the graphs are 
distinguished by the existence of an isolated or a universal vertex in one of them, 
then they are clearly distinguished in S2 fl FO^ with quantifier depth 2. Suppose, 
therefore, that none of G and H is normal. If one of them is connected and the 
other is not, then the graphs are again distinguished in E2 fl FO^ with quantifier 
depth 2. It remains to consider the case that G and H are both connected or both 
disconnected. W.l.o.g., assume the latter, i.e., that both G and H have isolated 
vertices. 

If one of the graphs has a single isolated vertex and the other of them has at least 
two, then they are distinguished in S2nF0^ with quantifier depth 3 (Spoiler pebbles 
2 isolated vertices in one of the graph and wins in the next round). It remains to 
consider the case when both G and H have exactly one isolated vertex or when both 
of them have at least two isolated vertices. 

Remove all isolated vertices from G and H and denote the resulting graphs by 
G' and H' . Note that both G' and H' must contain normal subgraphs. By the 
induction assumption, either Duplicator survives in Ehr^(G',/7') arbitrarily long 
or Spoiler needs at most v{G') rounds to win the 1-jump version of EHR2(G",i/') 
where he is allowed to start the game in any graph at his discretion. In the former 
case. Duplicator survives arbitrarily long as well in EHR^(G',if): whenever Spoiler 
pebbles an isolated vertex. Duplicator does the same in the other graph. In the 
latter case. Spoiler succeeds in the same type of the Ehrenfeucht game on G and 
if, needing perhaps one extra move to win. He plays inside G' and H\ using his 
winning strategy in the game on these graphs. Playing always inside G' and H\ 
Duplicator loses. Suppose that she pebbles an isolated vertex in G or H. Since 
Spoiler's counterpart of this vertex in H' or G' is not isolated. Spoiler wins in the 
next move by pebbling its neighbor. Note that no extra jump is required in this 
situation. By the induction assumption. Spoiler wins the 1-jump Ehr^(G, if) in 
v{G') + 1 < v{G) rounds. ■ 

Theorem 7.2 For every i>l there are bicolored trees Gi and Hi such that 

1. v{G,)=v{H,), 

2. DlXGi,Hi)<i + 2, 

3. DlXG„Hi) = 00. 

The proof of Theorem 17.21 is a reminiscence of an earlier work on logical com- 
plexity of trees in [H [29]. The same idea was used by Oleg Pikhurko (personal 
communication, 2007) to construct, for each i, a sequence of pairs of trees Gj and 
Hj such that Dj^^^Gj, Hj) = 0(1) while D-^.^Gj, Hj) — > 00 as j — )■ 00. 

Proof. We use induction on i. The graphs Gi and iii, both being trees with 7 
vertices colored in apricot or blue, are shown in Fig. |H We have D'^{Gi, Hi) < 2 
because Spoiler wins 'E}iR'^{Gi, Hi) by pebbling two blue vertices in Gi. We have 
Dy{Gi, Hi) = 00 because Duplicator has a simple winning strategy in EHR3(iii, Gi). 
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Figure 4: Proof of Theorem 17. 2[ The recursive construction of the sequence of 
graphs Gi and Hi. 

We designate the vertices of degree 3 in Gi and Hi as roots. Both Gi and Hi have 
three leaves of degree 1. The other three vertices will be called stalks. In general, 
we will use the latter term for any vertex adjacent to a leaf. If we cut off the leaves, 
the truncated Gi and Hi still satisfy Conditions 1-3 of the theorem. However, this 
exra level of vertices is needed in the recursive construction of Gi and Hi for i > 2. 

Suppose that the graphs and rooted trees with the same number of 

vertices colored in apricot or blue, are already constructed. The graph Hi consists 
of three disjoint copies of Gi^i and a new vertex colored in blue, designated as root 
in Hi, which is adjacent to the root of each Gj_i-part. The graph Gi is constructed 
similarly but, instead of three Gj^i-branches, it has two Gj-i-branches and one 
ifj_i-branch; see Fig. HI Obviously, Gi and Hi have equal number of vertices. 

In order to prove the bound D^.{Gi, Hi) < z + 2, it suffices to design a strategy 
for Spoiler allowing him to win the Sj-version of EHR^(Gj, Hi) in i + 2 moves, where 
he begins playing in Gi and up to z — 1 times is allowed to jump from one graph 
to another. The following climbing strategy serves this purpose. Spoiler pebbles the 
root in the first round and then successively pebbles vertices along a path upwards 
arriving at a blue leaf (he is allowed to jump i — 1 times and continue such a 
path in the other graph). Note that exactly this strategy is successful for him in 
EnR^{Gi, Hi), where Duplicator is forced either to break adjacency or to pebble 
an apricot stalk. Using induction in i, we show that Spoiler can force the same for 
every z > 2 in z + 1 moves under the condition that also Duplicator all the time 
climbs up in the trees. 
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In the first round, when Spoiler pebbles the root of Gi, Duphcator is forced 
to respond with pebbling the root of Hi. Otherwise, if she responds with a blue 
non-root vertex u in Hi, Spoiler jumps to Hi and pebbles the vertices along a path 
starting at u and finishing at an apricot stalk. Suppose that Duplicator does respond 
with pebbhng the root of H^. 

In the second round Spoiler pebbles the root of the ifi_i-branch in Gi. Not to 
lose immediately. Duplicator responds with pebbling the root of one of the Gi-i- 
branches in Hi. Spoiler jumps to Hi and, starting from the third round, plays 
the Sj_i-game Eur" {Gi.,,Hi_i) using the climbing strategy. If Duplicator always 
climbs up. Spoiler wins in the next i — 1 rounds by the induction assumption. 

There remains a possibility that Duplicator at some moment makes a move 
downwards. Then Spoiler, staying at a blue vertex w, starts pebbling the vertices 
along a shortest path from w to an apricot vertex (no extra jump is needed in this 
case). Duplicator loses because the distance from her standpoint to any apricot 
vertex is larger. Note that, if w is not a stalk. Spoiler wins by arriving at an apricot 
stalk; if w is a blue stalk. Spoiler pebbles the adjacent leaf. This is exactly the 
reason why we need an extra level of apricot vertices in Gi and Hi. 

It remains to prove that D^_{Gi, Hi) = oo. We describe a strategy for Duplicator 
in the Ilj-game EHR^(Gj,i7j) (where Spoiler begins playing in H^ and is allowed to 
jump from one graph to another at most i — 1 times) and prove by induction on i 
that the strategy allows her to resist arbitrarily long. An important feature of this 
strategy is that Duplicator will always respects tree levels, that is, the distance to 
the root. Playing EHRy(G'i, Hi), Duplicator is able to respect tree levels and vertex 
colors simultaneously, so we are done in the base case. 

Let i >2 and assume that Duphcator has a successful strategy for the Ilj.i-game 
EHR^(G'j_i, Hi_i). In the course of the game, we will call a branch of Gi or Hi free if 
it currently contains no pebbled vertex. We will call a branch of Gi and a branch of 
Hi linked if they contain vertices pebbled in a round. Duplicator plays the Ilj-game 
EHR^(Gj,i7i) as follows. 

• If Spoiler pebbles the root in Gi or Duplicator pebbles the root in the 
other graph. 

• If Spoiler moves in a free Gj-i-branch of Gi or ifj. Duplicator mirrors his move 
in a free Gj-i-branch in the other graph. Note that this is always possible 
because both Gi and Hi have at least two Gi_i-branches. 

• If Spoiler moves in the iJj_i-branch of Gi when it is free. Duplicator re- 
sponds in a free Gi_i-branch of Hi according to her strategy in the Ilj-i-game 

EHR2(Gi_i,if,_i). 

• If Spoiler places a pebble in a branch containing a vertex pebbled by the other 
pebble, Duplicator responds in the linked branch. If the linked branches are 
both copies of Gj-i, Duplicator mirrors Spoiler's move; otherwise she follows 
her strategy in the Ilj.i-game EHR^(Gj_i, ifj_i). 
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Note that this strategy is level-preserving because, by assumption, so is the strategy 
on Gi^i and Hi_i. 

Let us argue that Duplicator never loses the Hj-game on Gi and Hi. The strategy 
description ensures that her move in the first round is non-losing. Suppose that pairs 
of vertices {u, v) and (a, b) are pebbled in two successive rounds. Assume first that 
u and V are the roots of Gi and Hi. To prove that Duplicator does not lose, we have 
to check that adjacency and colors are preserved. Indeed, we have a ~ -u iff 6 ~ f 
because the strategy is level-preserving. Furthermore, a and b have the same color 
because, if they both are in Gj_i-branches, then Duplicator played by mirroring; 
while if a is in the ifj_i-branch of Gi, then Duplicator actually plays the first round 
of the game on Gj-i and (using a non-losing strategy). 

Assume now that u and v are non-roots. The case when Spoiler makes a move, 
a or b, in a free branch of Gi or Hi is tackled by the same argument as above. This 
case is even a little simpler because we clearly have a ^ u and b ^ v. It remains 
to consider the case when u and a are in the same branch of Gj, and v and b are 
in the same branch of Hi. If these branches are copies of then adjacency and 
colors are preserved because Duplicator played by mirroring. If u and a are in the 
ifj_i-branch of Gj, then Duplicator is alive because she uses a non-losing strategy 
for EHR^(Gj_i, A crucial observation here is that the game on and 

i?j_i-subgraphs can start only after Spoiler jumps from Gi to Hi. As one jump is 
aleady used, this is a Ilj.i game, and the induction assumption does apply. ■ 

Theorem 17.21 generalizes to any /c- variable logic and, ii k > 2, then no vertex 
coloring is needed any more. 

Theorem 7.3 Let k > 3. For every i > 1 there are uncolored trees Gi and Hi such 
that 

1. v{Gi)=v{Hi), 

2. Dl^{Gi,H,) <i + 5, 

3. Dl,^{Gi,H,) = oo. 

Proof. If colors (apricot and blue) are allowed, then the construction of Gi and Hi 
from the proof of Theorem 17. 2l can easily be adapted to suit k variables. Moreover, if 
there are more than two variables, the levels of apricot vertices in Gi and Hi are not 
needed any more. The only their purpose was to ensure Spoiler's win in the situation 
when the players arrived at vertices Ui G V{Gi) and vi G V{Hi) immediately before 
the stalk level, after which Spoiler pebbled a blue stalk, say, V2 G V{Hi), while 
Duplicator pebbled a blue vertex U2 G ^^(^2) in the lower level (because all stalks 
adjacent to ui were apricot). Now, that Spoiler has a third pebble, he puts it on 
V3, the other blue neighbor of Vi. Since Ui has only one blue neighbor. Duplicator 
loses. 

For each > 3, we now define trees Gi and Hi as follows. Let Hi consist of a 
blue root- vertex and k + 1 apricot leaves. Gi is obtained from Hi by recoloring one 
leaf in blue. Furthermore, Hi consists of the blue root and k + 1 Gj_i-branches. Gi 
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Figure 5: Proof of Theorem 17.31 The trees for 3- variable logic are still colored. 











Figure 6: Proof of Theorem 17.31 The uncolored versions of Gi and Hi for 3- variable 
logic. 



is obtained from Hi by replacing one Gj_i-branch with an iJj_i-branch; see Fig. [5l 
Except the aforementioned situation, analysis of Sj- and Hj-games on Gi and Hi is 
completely similar to the proof of Theorem 17.21 

In order to remove colors from Gi and Hi, we grow two extra levels of vertices in 
these trees. Specifically, we connect each apricot leaf a to a new vertex ai, which is 
adjacent to another new vertex 02. Each blue leaf h is connected to two new vertices 
hi and 62, which are non- adjacent to one another; see Fig. |6l Note that the modified 
graphs have vertices of four kinds: of degree 1 (all those are new vertices), of degree 
2 (which are of two sorts: old apricot vertices and adjacent new vertices), and of 
degree at least 3 {old blue vertices). 

In order to win the Sj-game on the modified uncolored graphs. Spoiler uses the 
climbing strategy as in the proof of Theorem 17.21 In at most i + 2 moves he forces 
pebbling a pair of old vertices that had different colors. Here we have two cases: 
Either Spoiler uses up all jumps and pebbles an old blue vertex while Duplicator 
pebbles an old apricot vertex (which includes the new situation discussed above), 
or Spoiler has at least one jump left over and pebbles an old apricot vertex while 
Duplicator pebbles an old blue vertex. Assume for a while that Spoiler can use 4 
pebbles. In the former case he wins by pebbling 3 neighbors of the old blue vertex; 
in the latter case he makes a remaining jump and wins in the same way. 

It remains to explain how Spoiler can win with 3 pebbles after an old blue vertex 
u (now of degree al least 3) and an old apricot vertex v (now of degree 2) are pebbled 
in a round. He first pebbles two neighbors ui and U2 of u. Duplicator must respond 
with the two neighbors vi and V2 oi v. In the next round Spoiler moves the pebble 
from u to its third neighbor M3. Duplicator must remove the pebble from v and place 
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it on some vertex ^3 non-adjacent to both vi and V2- Note that, while the distance 
between any two vertices of Ui, U2, and M3 equals 2, there is a pair of indices s and 
t such that Us and Ut are at the distance more than 2. Spoiler now wins by moving 
the pebble from Ug to u, where {q} = {1, 2, 3} \ {s, t}. 

Finally, we have to check that Duplicator survives arbitrarily long in the Hj-game 
on the modified uncolored graphs. This can be shown by induction on i, much the 
same as in the proof of Theorem 17.21 ■ 

Corollary 7.4 Let k > 2. The quantifier alternation hierarchy for FO^ over colored 
trees is strict: For every i there is a a class of colored trees T such that some sentence 
in Sj nFO*', if considered over colored trees, defines T while no sentence in IljnFO^ 
does it. Moreover, if k > 3, this holds true even for uncolored trees. 

Proof. Let Gi and Hi be colored trees as in Theorem 17.21 if A; = 2 or uncolored 
trees as in Theorem 17.31 if k > 3. Let $ G Sj fl FO'^ be a sentence distinguishing Gi 
from Hi. Define T to be the class of all trees satisfying Note that Gi eT while 
Hi ^ T. By this reason, no sentence in Ilj fl FO'^, if considered over trees, defines T 
(since it cannot distinguish Gi from Hi). ■ 



8 Questions and comments 

We now state a few remaining problems, focusing on the "smallest" particular cases 
that are open. 

Let G and H be colored graphs with v{G) = v{H) = n. We have D'^{G,H) < 
D^^{G,H) < Dl{G,H), where D\G,H) < n if it is finite and D^{G,H) can be 
finite but large, namely f2(?T,^). Where is D^~^{G,H) in this range (between 0{n) 
and n{n^))7 

Furthermore, is it true that there exists k, say k = 3, such that if D^{G, H) < 00, 
then we have D^{G,H) = 0{\ogn)7 In other words, do we have an exponential 
speed-up in the quantifier depth if we increase the number of variables? 

If we initially have 3 variables, a negative answer follows from a result of Immer- 
man [20] who constructed pairs of colored graphs G and H such that D^{G, H) < 00 
while Dpo{G, H) > 2^^°^^~^. However, the following modification of the question 
seems open: Do we have a speed-up under the stronger condition that G is definable 
in 3-variable logic, that is, D^{G,H) < 00 for every H non-isomorphic to G. 

Questions of this kind are related to the graph isomorphism problem. Assume 
that there is a function /, for example f{k) = 3k, such that if a graph G on n 
vertices is definable in FO|l, then it is definable in FO^'''^^ with quantifier depth 
0(log?T,). Such a speed-up result would imply that isomorphism of graphs excluding 
a fixed minor could be tested in NC. This implication follows from two known 
facts. First, graphs excluding a fixed minor M are definable in FO^ for some fixed 
k = k{M) (Grohe [lElITT]), and hence distinguishability in this logic coincides with 
isomorphism within this class of graphs. Second, distinguishability with logarithmic 
quantifier depth is decidable in NC (Grohe and Verbitsky [18j). 
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The best lower bound for D^{G,H), obtained by Fiirer [IT], is linear. In fact, 
Fiirer's construction demonstrates a rich dynamical behavior. For each h > 1, 
he constructs a sequence of pairs of graphs Gn and Hn with f = Q{n) and 
v{Hn) = 0(n) such that 



oo ii k < h, 

Q{n) ifh<k<2h, 

e(v^) if fc = 2/1 + 1, 

^O(logn) if A; = 3h. 



It is an interesting question whether such a behavior is "typical" . 
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